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ABSTRACT 

We present a counts-in-cells analysis of clustering in the optically selected 
Durham/UKST and Stromlo-APM Galaxy Redshift Surveys. Minimum variance esti- 
mates of the second moment, skewness (S'3) and kurtosis (S'4) of the count probability 
distribution are extracted from a series of volume limited samples of varying radial 
depth. The corresponding theoretical error calculation takes into account all sources of 
statistical error on the measurement of the moments, and is in good agreement with 
the dispersion over mock redshift catalogues. The errors that we find on S'3 and S'4 are 
larger than those quoted in previous studies, in spite of the fact that the surveys we 
consider cover larger volumes. S'3 varies little with cell size, with values in the range 
1.8 — 2.2 and errors 20%, for cubical cells of side 3 — 20/i~"'^Mpc. Direct measurements 
of S'3 are possible out to ~ 35/i~^Mpc, though with larger errors. A significant deter- 
mination of S'4 is only possible for one scale, / 6/i^^Mpc, with S4 « 5. We compare 
our results with theoretical predictions from A^body simulations of cold dark matter 
universes. Qualitatively, the skewness of the dark matter has the same form as that of 
the galaxies. However, the amplitude of the galaxy S'3 is lower than that predicted for 
the dark matter. Our measurements of S3 are consistent with the predictions of a simple 
model in which initially Gaussian fluctuations in the dark matter evolve gravitationally, 
if a second order bias term is specified, in addition to the traditional linear bias, in order 
to describe the relation between the distribution of galaxies and dark matter. 

Key words: methods: numerical - methods: statistical - galaxies: formation - large- 
scale structure of Universe 



1 INTRODUCTION 

Maps of the local universe have improved dramatically over 
the last decade and permit the clustering pattern of galaxies 
to be quantified on large scales (e.g. Efstathiou et al. 1990(a); 
Maddox et al. 1990; Saunders et al. 1991). Such observations 
can potentially constrain both the nature of the dark matter 
and the statistics of primordial density fluctuations. 

The flrst accurate measurements of the galaxy two-point 
correlation function on scales greater than 10/i~^Mpc indi- 
cated more structure than expected in the simplest form of 
the cold dark matter (CDM) model. This led to variants of 
the CDM model being studied (Efstathiou, Sutherland & 
Maddox 1990). Currently, the most successful CDM model 
is a low density, spatially flat universe with a cosmological 
constant, ACDM. The power spectrum in the ACDM model 
is described by a shape parameter F = 0.2 — 0.3 (in this Letter 
we use the parameterisation of the power spectrum given in 
Efstathiou, Bond & White 1992). If fluctuations in the dark 
matter are normalised so as to reproduce the local abundance 
of hot X-ray clusters (White, Efstathiou & Frenk 1993), the 



power spectrum in the ACDM model is similar to that ob- 
served for galaxies on scales around k ~ 0.05 — 0.2/!,Mpc~^ 
(Gaztanaga & Baugh 1998). On small scales, however, when 
the efi'ects of peculiar velocities are ignored (real space) , the 
dark matter power spectrum has a higher amplitude than 
the galaxy power spectrum (Gaztanaga 1995; Peacock 1997; 
Jenkins et al. 1998). Furthermore, the small scale power spec- 
trum for galaxies is a power law over a decade and a half in 
wavenumber, whereas the dark matter correlation function 
shows considerable curvature. 

Heuristic biasing schemes, in which the galaxy distribu- 
tion is proposed to be a local transformation of the smoothed 
density fleld, have enjoyed a certain degree of success in re- 
producing the observed correlation function (Coles 1993; Cole 
et al. 1998; Mann, Peacock & Heavens 1998; Narayanan et al. 
1999). Progress towards a physical understanding of the pro- 
cesses responsible for producing a bias between the galaxy 
and dark matter distributions has been made using semi- 
analytic models for galaxy formation (Benson et al. 2000a,b; 
Kauffmann et al. 1999). In a ACDM model that reproduces 
the bright end of the fleld galaxy luminosity function, Benson 
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Survey 


cell size 

(h-iMpc) 


R-max 

(/i-iMpc) 


Volume 
(lO^h-^Mpc^) 


Merit - Slogft 


Ngal 


S3 


Si 


Durham/UKST 


3.125 


170 


0.721 


-19.58 


510 


1.94±0.14 


1.5 


Durham/UKST 


6.3125 


180 


0.855 


-19.73 


515 


2.11±0.08 


5.0±3.8 


Durham/UKST 


12.625 


180 


0.855 


-19.73 


515 


1.82±0.21 


3.0 


Durham/UKST 


25. 


170 


0.721 


-19.58 


510 


1.67±1.32 


2.2 


Stromlo-APM 


3.9375 


180 


2.547 


-19.45 


471 


2.07±0.57 


13. 


Stromlo-APM 


8.875 


180 


2.547 


-19.45 


471 


1.89±0.17 


3.1 


Stromlo-APM 


18.1875 


190 


2.995 


-19.58 


465 


2.24±0.29 


8.2 


Stromlo-APM 


36.625 


200 


3.493 


-19.71 


434 


1.41±1.01 





Table 1. Minimum variance estimates of 83 and ^4 in cubical cells from the Durham/UKST and the Stromlo-APM Surveys. The errors 
on ^3 are the Icr theoretical errors for a sample with the volume, geometry and number of galaxies used in the measurement. The relative 
errors on the estimates of S4 are greater than 100 per cent apart from one Durham/UKST value. 



et al. find remarkably good agroomont with both the ampli- 
tude and power law slope of the correlation function of APM 
Survey galaxies (Baugh 1996). If the distortions to the clus- 
tering pattern caused by peculiar motions are included, the 
correlation function of the dark matter is very similar to that 
of the semi-analytic galaxies in the ACDM model, with no 
bias seen on small scales. The correlation function is also in 
good agreement with the measurements from galaxy redshift 
surveys (cf Fig. 1 of Benson et al 2000b). 

The constraints on models of galaxy formation provided 
by the two-point correlation function are somewhat limited. 
The second moment gives a full statistical description of 
the density field only in the case of very weak fluctuations. 
Galaxy clustering can be described in more detail if the 
J-point, volume-averaged, correlation functions, ^j, are ex- 
tracted. If the clustering results from the gravitational am- 
plification of a Gaussian primordial density field, then the 
J-point functions are predicted to follow a hierarchical scal- 
ing, = Sj^2~^. The amplitudes Sj do vary with scale, but 
at a much slower rate than the volume-averaged correlation 
functions (Juszkiewicz, Bouchet & Colombi 1993; Bernardeau 
1994) . This scaling behaviour has been studied extensively for 
cold dark matter in AT-body simulations (e.g. Bouchet, Scha- 
effer & Davis 1991; Baugh, Gaztafiaga & Efstathiou 1995; 
Gaztafiaga & Baugh 1995; Hivon et al. 1995; Colombi et al. 
1996; Szapudi et al. 1999b). 

Fry & Gaztafiaga (1993) proposed a simple bias model, 
based on the assumption that fluctuations in the galaxy dis- 
tribution can be written as a function of the dark matter 
fluctuations, when both fields are smoothed on large scales 
where I2 >C 1- The model gives predictions for the moments 
of the galaxy distribution in terms of the moments for the 
dark matter. To leading order in the dark matter variance, 
the galaxy variance is given by — b^(^^, whore 6 is usu- 
ally called the linear bias. To the same order, an additional or 
second order bias factor, 62, is required to specify the galaxy 
skewness: 

5f^ = i(53°^+3|). (1) 

Gaztafiaga & Fi-ieman (1994) discuss the implications of the 
measurements of Sj from the APM Survey for the bias pa- 
rameters in this model. 

In this Letter, we analyse the clustering in two optically 
selected redshift surveys that sample large volumes of the 
local universe. The Durham/UKST Survey (Ratcliffe et al. 
1998) and Stromlo-APM Survey (Lovoday et al. 1996) arc 
magnitude limited to bj w 17. Gala^xies are sparsely sampled 
from the parent catalogues at a rate of l-in-3 in the case of 
the Durham/UKST Survey and l-in-20 for the Stromlo-APM 



Survey. The Stromlo-APM Survey covers a three times larger 
solid angle than the Durham/UKST Survey. By combining 
the results from the two surveys, the Sj can be determined 
over a large dynamic range in cell size. 



2 COUNTS-IN-CELLS METHODOLOGY 

The technique of measuring the distribution of galaxy counts 
in cells is well established as a means of quantifying large scale 
structure (Peebles 1980). The basis of the method is to throw 
a large number of cells onto the galaxy distribution in order 
to obtain the probabifity distribution of finding N galaxies 
in a cell of a given size /. The moments of the count proba- 
bility distribution arc estimated using the factorial moment 
technique, wiiicii automatically adjusts the moments to com- 
pensate for the sampling of a continuous density field using 
discrete galaxies (Szapudi & Szalay 1993; Szapudi, Meiskin & 
Nichol 1996). The approach that we adopt here differs in two 
respects from most previous work. A similar methodology is 
applied to the PSCz Survey by Szapudi et al. (2000). 

The first difference lies in how the higher order moments 
are extracted from the redshift survey. The count probability 
distribution is measured in a series of volume limited samples 
of varying radial depth drawn from the fiux limited survey. 
The moments obtained for a particular cell volume are com- 
pared between the different volume limited samples and the 
minimum variance estimate is adopted as our measurement 
for this scale. The construction of volume limited samples is 
straightforward: a maximum redshift for the sample is de- 
fined and any galaxy from the flux limited redshift survey 
that would remain visible if displaced out to this redshift is 
included in the sample (see, for example, Hoyle et al. 1999). 

The number density of galaxies in a volume limited sam- 
ple is effectively independent of radial distance, with small 
fluctuations due to large scale structure. This is in direct 
contrast to a flux limited survey, whore the number density 
changes rapidly with radius. To analyse the count distribu- 
tion iu a flux limited catalogue, a weight must be assigned to 
each galaxy to compensate for the radial selection function. 
The analysis of volume limited samples is therefore much 
simpler, and gives equivalent results without introducing any 
biases: moreover, the task of devising an optimal weighting 
scheme and of constructing a suitable estimator of the mo- 
ments to apply to the flux limited sample is avoided (Colombi, 
Szapudi & Szalay 1998). 

This approach does, however, rely upon the assumption 
that galajcy clustering does not depend on luminosity, at least 
over the range of luminosities that we consider in our sam- 
ples (see column 4 of Table 1 for the absolute magnitudes 
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that define the volume hmited samples we analyse). Love- 
day et al. (1995) measured the two-point correlation func- 
tion in redshift space for galaxies selected from the Stromlo- 
APM survey on the basis of absolute magnitude. These au- 
thors found no significant evidence for a difference in cluster- 
ing amplitude when comparing samples over a much broader 
range of absolute magnitudes than we consider in our analy- 
sis. Similar conclusions were reached by Tadros & Efstathiou 
(1996) who analysed the amplitude of the power spectrum 
in different volume limited samples drawn from the same 
survey. A weak effect, at just over the la level, was seen 
only for the deepest sample, corresponding to an absolute 
magnitude of Mtj = -20.3. Hoyle et al. (1999) found that 
the power spectra in volume limited samples drawn from the 
Durham/UKST survey vary by less than the la errors as the 
depth of the sample is changed. Therefore, the approximation 
that the intrinsic clustering in redshift space is the same in 
different volume limited samples is fully justified by previous 
work on the surveys we analyse in this Letter. 

The second difference from previous work is the treat- 
ment of the errors on the measured moments. A theoretical 
calculation of the errors is made using the method described 
by Szapudi, Colombi & Bernardeau (1999a) ^ All the possi- 
ble sources of statistical error are included in the calculation, 
namely the following. 

(i) Finite survey volume. The finite volume of the survey 
means that fluctuations on scales larger than the survey 
volume are not probed at all. In addition, fluctuations on 
scales approaching the maximum dimensions of the survey 
are poorly sampled. 

(ii) Edge effects. The density fleld around galaxies that lie 
close to the survey boundary is not sampled as well as it is 
for a galaxy that is well within the boundary. This is because 
cells are not permitted to straddle the survey boundary. 

(iii) Discreteness. The underlying density fleld is assumed to 
be continuous. Sampling this field discretely with galaxies 
makes an additional contribution to the measured moments. 

(iv) Sampling or measurement errors due to the finite number 
of cells used to construct the count probability distribution. 

The theoretical calculation of the errors requires a num- 
ber of quantities to be specified beforehand. Some of these, 
namely the measured values of the variance and Sj for a given 
ceU size and the sample volume, are estimated directly from 
the sample. The other quantities, the variance over the full 
sample volume and the higher order cumulant correlators, are 
treated as parameters. The errors that we obtain are fairly 
insensitive to reasonable choices for the values of these pa- 
rameters (for a full discussion see Szapudi et al. 1999a). 

The theoretical error calculation has been extensively 
tested for clustered distributions of dark matter using A^- 
body simulations (Colombi et al. 2000). As a further check 
of the calculation, we have compared the results with the 
dispersion found for the moments averaged over 40 mock 
Durham/UKST Survey samples with a redshift limit of 2 = 
0.06, extracted from the Hubble Volume Af-body simulation 
as described in Hoyle et al. (1999). For each cell size in this 
comparison, the measurement is selected from the volume 
limited sample that gives the minimum variance value for 
the higher order moment. The sample that yielded the best 
measurement was found to be the same whether the theoreti- 



* The FORCE package (FORtran for Cosmic Errors) was used to 
compute errors. It is available upon request from its authors, S. 
Colombi (colombi@iap.fr) or IS (szapudi@cita.utoronto.ca). 



-Ill 


1 1 1 1 1 1 1 1 1 1 


i 


-• Q 

: • e 


(a) 
















- 

- 
- 


'■■■-1 >p 

1 1 1 1 1 1 1 1 1 


- 
- 




(b) 












~ ~ ~ 


- 


- 


'■■■■"."^^^ 


- 


- 1 1 1 1 


"-. ^ ^ 

1 1 1 1 1 1 1 1 I 1 1 




0.5 


1 1.5 


2 



logi„(//h-'Mpc) 

Figure 1. The variance of counts in cubical cells. In both panels, 
solid circles show the variance in the Durham/UKST Survey, whilst 
open circles the Stromlo-APM Survey results. In panel (a), we 
show the variance in volume limited samples with Zniax — 0.06. 
The solid line shows an estimate of the variance made from the 
power spectrum measured in the same Durham/UKST sample by 
Hoyle et al. (1999); the dotted lines show the la errors. The crosses 
show the variance for the flux limited Stromlo-APM survey from 
Loveday et al. (1992). The error bars on the Loveday et al. points 
show 95 per cent confidence limits. In panel (b), the circles show 
the best estimates of the variance, extracted from a series of volume 
limited samples. The lines show the variance in redshift space for 
the A'^body simulations discussed in Section 4: the solid line is for 
a simulation with a linear power spectrum described by F = 0.2 
and eg = 1, the dashed line for P = 0.5 and erg = 1 and the dotted 
line for F = 0.5, erg = 0.66 (erg is the rms density fluctuation in 
spheres of radius 8/i~^Mpc). 

cal error or the dispersion over the mock catalogues was used. 
Furthermore, on this scale, the magnitude of the two error es- 
timates agree to better than 10 per cent. The magnitude of 
the theoretical errors is within 50 per cent of the dispersion 
over the mock catalogues on scales that do not give the min- 
imum variance estimates of the higher order moments in a 
particular sample. 



3 RESULTS 

The galaxy count probability distribution is measured in cu- 
bical cells of side 3 — 40/i~^Mpc in a series of volume limited 
samples drawn from the Durham/UKST and Stromlo-APM 
redshift surveys. The limiting redshifts of the samples are in 
the range z ~ 0.05 — 0.08, corresponding to maximum radial 
depths of 140-220ft~^Mpc. The higher order moments are 
calculated from the count probability distribution using the 
factorial moment technique introduced by Szapudi & Szalay 
(1993). In practice, measurement errors, (iv) in the list of sta- 
tistical errors given in Section 2, are negligible in comparison 
to the other contributions, because on the order of 10* cells 
are used to determine the count distribution at each scale. 
This massive oversampling of the density field is achieved 
using the algorithm developed by Szapudi (1998). 

The second moment or variance of the galaxy distri- 
bution is shown in FigEI. In both panels, the filled circles 
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Figure 2 . The skewness extracted from the redshift surveys (filled 
circles show Durham/UKST results, open circles show Stromlo- 
APM results) compared with the three dimensional values inferred 
from the parent angular catalogues (the open triangles show the 
APM Survey results from Gaztafiaga 1994, and the filled triangles 
show the results from the Edinburgh- Durham Southern Galaxy 
Catalogue from Szapudi, Meiksin &; Nichol 1996). 



The minimum variance estimates of S3 from the 
Durham/UKST and Stromlo-APM surveys are listed in Ta- 
ble 1, along with the properties of the volume limited sample 
in which the measurement was made. The errors on 5*3 are 
the 1(7 theoretical errors predicted for a sample of this volume 
and geometry and containing the stated number of galaxies. 
For cubical cells between 3 and 20 /i~^Mpc, we find remark- 
ably little variation in the value of S3, with errors in the 
range 10 — 20 per cent, which again provides further evidence 
against any significant luminosity dependence of clustering. 
We obtain S3 on scales larger than 20ft~^Mpc, but with much 
larger errors. 

When the relative error on the estimate of Sj approaches 
100 per cent, the perturbative techniques used in the error 
calculation break down. Nevertheless, the calculation still re- 
liably indicates that the errors are large and that the mea- 
surement has no significance. The relative errors on S4 are 
estimated to be >100 per cent on all scales in the Stromlo- 
APM survey. There is only one scale where S4 can be reli- 
ably constrained from the Durham/UKST survey. This scale 
is also the one for which S3 is most accurately measured in 
this sample. As we expect this to be the case in general, the 
values for S4 on the same scale as the minimum variance mea- 
surements of S3 are listed in Table |l| These estimates should 
be treated with caution as the errors are large. 



show measurements obtained from the Durham/UKST Sur- 
vey and the open circles show those from the Stromlo-APM 
Survey. Fig. ^(a) shows the variance as a function of cell 
size in volume limited samples extracted from the survey, 
with a maximum redshift of z = 0.06. Fig. 3 of Hoyle et 
al. (1999) shows that the number of galaxies as a function 
of the maximum redshift defining a volume limited sample 
peaks at this redshift for both surveys. These results are in 
good agreement with estimates of the variance made from 
the surveys using different techniques. The solid line shows 
an independent estimate of the variance obtained from the 
the power spectrum of the same volume limited sample from 
the Durham/UKST Survey from Hoyle et al. (1999), for 
wavenumbers k < 0.43/iMpc~^. We have used the approx- 
imate transformation between power spectrum and variance 
given in Peacock (1991). The dotted lines show the la error 
on this estimate, which comes directly from the error on the 
measured power spectrum. The very good level of agreement 
between these different estimates demonstrates that large vol- 
ume cells genuinely measure fluctuations on large scales. Our 
results for a volume limited subsample of the Stromlo-APM 
survey agree well with those obtained from the full magnitude 
limited survey shown by the crosses in Fig. |^(a) (Loveday et 
al. 1992). The error bars on these points show the 95 per 
cent percent confidence limits and are computed under the 
assumption that the distribution of fiuctuations is Gaussian. 

In Fig. ^(b), the points show the best estimates of 
the variance extracted from the two surveys as described 
in Section 2. The best estimates of the variance from the 
Durham/UKST survey come from two samples, with radial 
limits of i?nmx = 170/i"^Mpc and i?max = 180 /i"^Mpc; 
reading from left to right, the first two points and the last 
point in Fig. |l|(b) come from the 7?max = 170/i~'^Mpc sample, 
whilst the third and fourth points come from the -Rmax = 180 
h'^Mpc sample. The smoothness of the locus traced out by 
the points supports our assumption that there is no signifi- 
cant dependence of clustering strength on luminosity over the 
samples considered. The lines in |l|(b) show the variance in a 
set of representative CDM simulations; these simulations are 
discussed in Section 4. 



4 DISCUSSION 

The mean values we obtain for the skewness are in agree- 
ment with those found in shallower redshift surveys, though 
we find errors that are somewhat larger (e.g. Gaztanaga 1992; 
Bouchet et al. 1993; Fry & Gaztanaga 1994; Benoist et al 1999 
and for a comprehensive compilation of results and a more 
exhaustive set of references, see table 1 of Hui & Gaztafiaga 
1999). Moreover, in spite of the relatively large volumes of 
the surveys considered in this Letter, we find that a signifi- 
cant measurement of S4 is only possible at one scale. There 
are two main reasons for the discrepancy in the magnitude 
of the estimated errors. The first is that some previous re- 
sults are quoted as averages over the values of S3 determined 
on different scales, exploiting the relatively flat form of S3 
in redshift space. This leads to smaller errorbars under the 
incorrect assumption that the individual measurements are 
independent. The second reason is that not all of the con- 
tributions to the statistical errors listed in Section 2 were 
considered in previous analyses. 

We have constrained S3 over a wide range of scales, ex- 
tending beyond I ~ 20/i~^Mpc, where simple models for bias 
can be tested most cleanly. Indirect measurements of S3 on 
these scales have been obtained from the IRAS 1.2-Jy Red- 
shift Survey by fitting a parametric functional form for the 
count probability distribution to the measured counts (Kim & 
Strauss 1998). The choice of function is not physically moti- 
vated and the error model used is simplistic and may under- 
estimate the true variance (Gaztafiaga, Fosalba & Elizalde 
1999; Hui & Gaztafiaga 1999). Szapudi et al. (2000) have 
measured S3 from the IRAS PSCz survey, using the same 
techniques employed in this paper, and find S3 = 0.87 ± 0.48 
for cells of side / = 37/i~^Mpc, which is in good agreement 
with the value we find, quoted in Table 1. 

We compare our measurements of S3 with the values in- 
ferred from the parent angular catalogues of the redshift sur- 
veys in Fig H (Gaztafiaga 1994; Szapudi, Meiksin & Nichol 
1996). The results from the angular catalogues are obtained 
by first finding the projected count distribution on the sky. 
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Figure 3. A comparison of the minimum variance measurements of the skewness listed in Table 1 with the skewness obtained from 
A'^body simulations. In each panel, the filled circles show the skewness measured in the Durham/UKST Survey and the open circles show 
Stromlo-APM Survey results. The light lines show the linear perturbation theory predictions for 53 in real space and are reproduced in 
each panel; the solid line shows the skewness for a power spectrum with F = 0.2, and the dashed line shows the result for T = 0.5. The 
heavy lines show the simulation results and the dotted lines show the error on the mean over five realisations of the initial density field. 
The heavy dashed (solid) lines show the skewness measured in real (redshift) space. The simulation outputs are described by the following 
sets of power spectrum parameters: (a) F = 0.2 and erg = 1, (b) F = 0.5 and trg = 0.66 and (c) F = 0.5 and erg = 1. 



and then applying a deprojection algorithm to infer the mo- 
ments in three dimensions. The algorithm requires knowl- 
edge of the survey selection function. The deprojected an- 
gular measurements are in real space as they are free from 
any distortion due to the peculiar motions of galaxies. On 
large scales, the mean value we find for Ss is below that 
found in real space. However, the errors are large on both 
measurements, and the results are consistent at the la level. 
Moreover, it is somewhat unclear exactly how important edge 
effects in the angular measurements and systematic effects in 
the deprojection technique are on these large scales (Szapudi, 
Meiksin & Nichol 1996; Gaztanaga & Baugh 1998, Gaztanaga 
& Bernardeau 1998; Szapudi & Gaztanaga 1998). 

On small and intermediate scales, I < 15/i~^Mpc, our de- 
terminations are below those obtained from the angular cat- 
alogues. This is due to redshift space distortions. The same 
qualitative behaviour is seen for S3 measured in real space 
and redshift space in numerical simulations of hierarchical 
clustering. In Fig. |^, we compare 83 measured in the A''-body 
simulations used by Gaztafiaga & Baugh (1995), which are 
representative of the behaviour in CDM models, with the 
redshift survey results. The heavy dashed lines in each panel 
show S3 in real space, and the heavy solid lines show S3 in- 
cluding the efi^ects of the peculiar motions of the dark matter. 
The dotted lines show the error on the mean obtained over 
five realisations of the initial conditions (the box size of the 
simulations is 378/i~^Mpc). Two different power spectra are 
considered: panel (a) shows a model with F = 0.2 and (b) and 
(c) show a model with F — 0.5 at two different epochs. On 
large scales, the value of S3 depends upon the shape of the 
power spectrum and is in good agreement with the perturba- 
tion theory predictions, which are shown by the light lines; 
this result was discussed by Gaztafiaga & Baugh (1995). The 



value of S3 in redshift space also depends upon the shape of 
the power spectrum, and is insensitive to epoch or equiva- 
lently to the amplitude of the fluctuations, as shown by Figs, 
^(b) and (c). The real and redshift space values of S3 become 
consistent at / « 20^~^Mpc, in excellent agreement with the 
comparison presented for the data in Fig. GI 

We now investigate how the predictions from the simu- 
lations can be reconciled with the observations and discuss 
the implications for biasing. The model developed by Fry & 
Gaztafiaga (1993) predicts a relationship between the skew- 
ness in the galaxy distribution, Sf^', and that in the under- 
lying dark matter, Ss"^, that is applicable on large scales 
(equation |l]). The variance for the dark matter in the simu- 
lation with F = 0.2 and erg = 1 is very close to the observed 
variance in galaxy counts (cf. the solid line in Fig ^), indi- 
cating that a relatively small linear bias term is required; at 
/ ~ 20/i~^Mpc, the linear bias is 6 = 1.16±0.06. Furthermore, 
in redshift space, the linear bias is essentially independent of 
scale. Thus, given the scale independence of the skewness 
that we measure for galaxies and which is predicted for the 
dark matter from the simulations, we can insert the values 
for Sf', S'J'^ and b into equation |^ and obtain a value for 
the second order bias term, 62. At i ~ 20/!.~^Mpc, a second 
order bias term of value 62 = —0.20 ± 0.14 is required for the 
skewness of the dark matter to match that seen for galaxies. 
For the simulation with F = 0.5 and erg = 0.66, the linear bias 
term is larger (cf. the dotted line in Fig. 0), b = 1.86 ± 0.10, 
and the second order bias term is 62 ~ 1-0 ± 0.4. Hence, 
whilst a linear bias term is sufficient to reconcile the variance 
measured in redshift space for galaxies and for dark matter, 
additional bias terms are required to match up the results for 
the skewness. 

A similar counts in cells analysis has been applied to 
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the PSCz Survey, and yields values for S3 that are in good 
agreement with those reported here at all scales (Szapudi et 
al. 2000). At first sight this result is intriguing, in view of 
the well known difference in the amplitude of the two-point 
functions of optical and infra-red selected galaxies on large 
scales (e.g. Peacock 1997; Hoyle et al. 1999). Thus having 
demonstrated the need to consider a second order bias term in 
addition to the linear bias usually discussed, it would appear 
that both these quantities can depend on the way in which 
galaxies are selected. These issues are best addressed using 
semi-analytic models of galaxy formation (Baugh et al. 2000). 
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